Everything Totally Explained


Ask & we'll explain, totally!
Correlation does not imply causation
Totally Explained


  NEW! All the latest news in the worlds of computer gaming, entertainment, the environment,  
finance, health, politics, science, stocks & shares, technology and much, much, more.  


    View this entry using RSS
   

Everything about Correlation Does Not Imply Causation totally explained

Correlation doesn't imply causation is a phrase used in the sciences and statistics to emphasize that correlation between two variables doesn't imply that one causes the other. Its negation, correlation proves causation, is a logical fallacy by which two events that occur together are claimed to have a cause-and-effect relationship. The fallacy is also known as cum hoc ergo propter hoc (Latin for "with this, therefore because of this") and false cause. By contrast, the fallacy post hoc ergo propter hoc requires that one event occurs before the other and so may be considered a type of cum hoc. In a widely-studied example, numerous epidemiological studies showed that women who were taking combined hormone replacement therapy (HRT) also had a lower-than-average incidence of coronary heart disease (CHD), leading doctors to propose that HRT was protective against CHD. But controlled trials showed that HRT caused a small and significant increase in risk of CHD. Re-analysis of the data showed that women undertaking HRT were more likely to be from socio-economic groups ABC1, with better than average diet and exercise regimes. The two were coincident effects of a common cause, rather than cause and effect as had been supposed.

Usage

In the strictest sense, it's always correct to say "Correlation doesn't imply causation". However, the word "imply" in casual use loosely means suggests rather than requires. The idea that correlation and causation are connected is certainly true; correlation is needed for causation to be proved.
   However, in logic, the technical use of the word "implies" means » * to be a sufficient circumstance.

This is the meaning intended by statisticians when they say causation isn't certain. Indeed, p implies q has the technical meaning of logical implication: if p then q symbolized as p → q. That is "if circumstance p is true, then q necessarily follows."
   In contrast, the everyday English meaning of "imply" is » * To indicate or suggest.

To say that "Correlation doesn't suggest causation" is false: A demonstrably consistent correlation often suggests some causal relationship (or implies it, in the casual sense of the word). What correlation doesn't do is prove causation. Arguments that assert this suffer from the cum hoc ergo propter hoc logical fallacy. Edward Tufte, in a criticism of the brevity of Microsoft PowerPoint presentations, deprecates the use of "is" to relate correlation and causation (as in "Correlation isn't causation"), citing its inaccuracy as incomplete. While it isn't the case that correlation is causation, simply stating their nonequivalence omits information about their relationship. Tufte suggests that the shortest true statement that can be made about causality and correlation must be at least expanded to either » Empirically observed covariation is a necessary but not sufficient condition for causality.

or » Correlation isn't causation but it sure is a hint.

General pattern

The cum hoc ergo propter hoc logical fallacy can be expressed as follows:
  • A occurs in correlation with B.
  • Therefore, A causes B.
In this type of logical fallacy, one makes a premature conclusion about causality after observing only a correlation between two or more factors. Generally, if one factor (A) is observed to only be correlated with another factor (B), it's sometimes taken for granted that A is causing B even when no evidence supports this. This is a logical fallacy because there are at least four other possibilities:
  • B may be the cause of A, or
  • some unknown third factor is actually the cause of the relationship between A and B, or
  • the "relationship" is so complex it can be labelled coincidental (for example, two events occurring at the same time that have no simple relationship to each other besides the fact that they're occurring at the same time).
  • B may be the cause of A at the same time as A is the cause of B (contradicting that the only relationship between A and B is that A causes B). This describes a self-reinforcing system. In other words, there can be no conclusion made regarding the existence or the direction of a cause and effect relationship only from the fact that A and B are correlated. Determining whether there's an actual cause and effect relationship requires further investigation, even when the relationship between A and B is statistically significant, a large effect size is observed, or a large part of the variance is explained.

    Examples

    » Sleeping with one's shoes on is strongly correlated with waking up with a headache.


       Therefore, sleeping with one's shoes on causes headache.
       The above example commits the correlation-implies-causation fallacy, as it prematurely concludes that sleeping with one's shoes on causes headache. A more plausible explanation is that both are caused by a third factor, in this case alcohol intoxication, which thereby gives rise to a correlation. Thus, this is a case of possibility (2) above. » The more firemen fighting a fire, the more damage there's going to be.


       Therefore firemen cause damage.
       The above example is simple and easy to understand. The strong correlation between the number of firemen at a scene and the damage that's caused doesn't imply that the firemen cause the damage. Firemen are sent according to the severity of the fire and if there's a large fire, a greater number of firemen is sent. Large fires cause more damage. » With a decrease in the number of pirates we've seen an increase in global warming over the same time period.


       Therefore, global warming is caused by a lack of pirates.
       The example above is used satirically by the parody religion Pastafarianism to illustrate the logical fallacy of assuming that correlation equals causation. » Young children who sleep with the light on are much more likely to develop myopia in later life.

    The former is a recent scientific example that resulted from a study at the University of Pennsylvania Medical Center. Published in the May 13, 1999 issue of Nature, the study received much coverage at the time in the popular press . However, a later study at Ohio State University didn't find a link between infants sleeping with the light on and development of myopia. It did find a strong link between parental myopia and the development of child myopia, also noting that myopic parents were more likely to leave a light on in their children's bedroom . This is a case of (2).
       Another example: » Since the 1950s, both the atmospheric CO2 level and crime levels have increased sharply.


       Hence, atmospheric CO2 causes crime.
       The above example arguably makes the mistake of prematurely concluding a causal relationship where the relationship between the variables, if any, is so complex it may be labelled coincidental. The two events have no simple relationship to each other beside the fact that they're occurring at the same time. This is a case of possibility (3) above; another such example is the hoax Mierscheid Law.

    Determining causation

    David Hume argued that causality can't be perceived (and therefore can't be known or proven), and instead we can only perceive correlation. However, he argued that we can use the scientific method to rule out false causes.
       Intuitively, causation seems to require not just a correlation, but a counterfactual dependence. Suppose that a student performed poorly on a test and guesses that the cause wasn't studying. To prove this, we think of the counterfactual - the same student writing the same test under the same circumstances but having studied the night before. If we could rewind history, and change only one small thing (making the student study for the exam), then causation could be observed (by comparing version 1 to version 2). Because we can't rewind history and replay events after making small controlled changes, causation can only be inferred, never exactly known. This is referred to as the Fundamental Problem of Causal Inference - it's impossible to directly observe causal effects.
       A major goal of scientific experiments and statistical methods is to approximate as best as possible the counterfactual state of the world. For example, one could run an experiment on identical twins who were known to consistently get the same grades on their tests. One twin is sent to study for six hours while the other is sent to the amusement park. If their test scores suddenly diverged by a large degree, this would be strong evidence that studying (or going to the amusement park) had a causal effect on test scores. In this case, correlation between studying and test scores would almost certainly imply causation.
       Well designed statistical studies replace equality of individuals as in the previous example by equality of groups. This is achieved by randomization of the subjects to two or more groups. Although not a perfect system, placing the subjects randomly in the treatment/placebo groups ensures that it's highly likely that the groups are reasonably equal in all relevant aspects. If the treatment has a significantly different effect than the placebo, one can conclude that the treatment is likely to have a causal effect on the disease. This likeliness can be quantified in statistical terms by the P-value.

    References and notes

    Further Information

    Get more info on 'Correlation Does Not Imply Causation'.


    External Link Exchanges

    Do you know how hard it is to get a link from a large encyclopaedia? Well we're different and will prove it. To get a link from us just add the following HTML to your site on a relevant page:

      <a href="http://correlation_does_not_imply_causation.totallyexplained.com">Correlation does not imply causation Totally Explained</a>

    Then simply click through this link from your web page. Our crawlers will verify your link, extract the title of your web page and instantly add a link back to it. If you like you can remove the words Totally Explained and embed the link in article text.
       As long as your link remains in place, we'll keep our link to you right here. Please play fair - our crawlers are watching. Your site must be closely related to this one's topic. Any kind of spamming, dubious practises or removing the link will result in your link from us being dropped and, potentially, your whole site being banned.



  • Copyright © 2007-8 totallyexplained.com | Licensed under the GNU Free Documentation License | Site Map
    This article contains text from the Wikipedia article Correlation does not imply causation (History) and is released under the GFDL | RSS Version